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Abstract 

Description Logics (DLs) are a family of knowledge representation for- 
malisms mainly characterised by constructors to build complex concepts 
and roles from atomic ones. Expressive role constructors are important 
in many applications, but can be computationally problematical. 

We present an algorithm that decides satisfiability of the DL AjCC ex- 
tended with transitive and inverse roles and functional restrictions with 
respect to general concept inclusion axioms and role hierarchies; early ex- 
periments indicate that this algorithm is well-suited for implementation. 
Additionally, we show that ACC extended with just transitive and inverse 
roles is still in PSpace. We investigate the limits of decidability for this 
family of DLs, showing that relaxing the constraints placed on the kinds 
of roles used in number restrictions leads to the undecidability of all infer- 
ence problems. Finally, we describe a number of optimisation techniques 
that are crucial in obtaining implementations of the decision procedures, 
which, despite the hight worst-case complexity of the problem, exhibit 
good performance with real-life problems. 



1 Motivation 

Description Logics (DLs) are a well-known family of knowledge representation 



formalisms [Doniiii et at, 199£|. They are based on the notion of concepts 



*This paper appeared in the Logic Journal of the IGPL 8(3):239-264, May 2000. 



1 



(unary predicates, classes) and roles (binary relations), and are mainly char- 
acterised by constructors that allow complex concepts and roles to be built 
from atomic ones. Sound and complete algorithms for the interesting inference 
problems such as subsumption and satisfiability of concepts are known for a 
wide variety of DLs. 

Transitive and inverse roles play an important role not only in the adequate 
representation of complex, aggregated objects [ Borrocks fc Sattlcr, 199E ] , but 
also for reasoning with conceptual data models [ galvancsc et ai, 199 -If . More- 
over, defining concepts using general concept inclusion axioms seems natural 
and is crucial for representing conceptual data models. 

The relevant inference problems for (an exte nsion of) ACC augmented in th e 

described manner are known to be decidablc | Dc Giacomo fc Lcnzcrini, 1996 1, 

and worst-case optimal inference algorithms have been described | De Giacomo fc Massacci, 1999 . 
However, to the best of our knowledge, nobody has found efficient means to deal 
with their high degree of non-determinism, which so far prohibits their use in 
realistic applications. This is mainly due to the fact that these algorithms 
can handle not only transitive roles but also the transitive closure of roles. It 



has been shown [Battler, 199£] that restricting the DL to transitive roles can 
lead to a lower complexity, and that transitive roles, even when combined with 
role hierarchies, allow for algorithms that behave quite well in realistic applica- 
tions [Eorrocks, 1998b|. However, until now it has been unclear if this is still 
true when inverse roles are also present. 

In this paper we present various aspects of our research in this direction. 
Firstly, we motivate our use of logics with transitive roles instead of transitive 
closure by contrasting algorithms for several pairs of logics that differ only in 
the kind of transitivity supported. 

Secondly, we present an algorithm that decides satisfiability of ACC extended 
with transitive and inverse roles, role hierarchies, and functional restrictions. 
This algorithm can also be used for checking satisfiability and subsumption 
with respect to general concept inclusion axioms (and thus cyclic terminologies) 
because these axioms can be "internalised" . The fact that our algorithm needs 
to deal only with transitive roles, instead of transitive closure, leads to a lower 
degree of non-determinism, and experiments indicate that the algorithm is well- 
suited for implementation. 

Thirdly, we show that ACC extended with both transitive and inverse roles 
is still in Pspace. The algorithm used to prove this result introduces an en- 
hanced blocking technique that should also provide useful efficiency gains in 
implementations of more expressive DLs. 

Fourthly, we investigate the limits of decidability for this family of DLs, 
showing that relaxing the constraints we will impose on the kind of roles allowed 
in number restrictions leads to the undecidability of all inference problems. 

Finally, we describe a range of optimisation techniques that can be used 
to produce implementations of our algorithms that exhibit good typical case 
performance. 
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2 Preliminaries 



In this section, we present the syntax and semantics of the various DLs that 
are investigated in subsequent sections. This includes the definition of inference 
problems (concept subsumption and satisfiability, and both of these problems 
with respect to terminologies) and how they are interrelated. 

T he logics we will discuss are all bas ed on an extension of the well known DL 
ACC [Schmidt-Schaufi fc Smolka, 1991 to include transitively closed primitive 



roles [ Battler, 1996 ] ; we will call this log ic S due to its relationship with the 
propositional (multi) modal logic S4( m ) |3child, 1991 ].[] This basic DL is then 



extended in a variety of ways — see Figure |l] for an overview. 
Definition 2.1 

Let Nq be a set of concept names and R a set of role names with transitive role 
names R+ C R. The set of Si-roles is RU {R~ | R £ R}. To avoid considering 
roles such as R , we dehne a function Inv on roles such that lnv(i?) = R~ if R 
is a role name, and lnv(i?) = S if R — S~ . In the following, when speaking of 
roles, we refer to ST-roles, as our approach is capable of dealing uniformly with 
both role names and inverse roles. 

Obviously, a role R is transitive iff lnv(f?) is transitive. We therefore define 
Trans to return true iff R is a transitive role. More precisely, Trans(f?) = true 
(and we say that R is transitive) iff R £ R + or lnv(i?) £ R + . 

The set of <SI-concepts is the smallest set such that 

1. every concept name is a concept, and, 

2. if C and D are concepts and R is an SX-role, then (C\1D), (CUD), (->C), 
(V-R.C), and (3R.C) are also concepts. 

A role inclusion axiom is of the form R U S, where R and S are two roles, 
each of which can be inverse. A role hierarchy is a finite set of role inclusion 
axioms, and STLX is obtained from SI by allowing, additionally, for a role 
hierarchy 1Z. The sub-role relation E is the transitive-reflexive closure of U 
over K U {lnv(i?) U Inv(S) | R E S £ K}. 

SJiTQ is obtained from SJiT by allowing, additionally, for qualiGed number 
restrictions \Hollunder & Baader, 199l§ , i.e., for concepts of the form ^nR.C 
and ^nR.C, where R is a simple role, C is a concept, and n £ N. A role is 
called simple iff it is neither transitive nor has transitive sub-roles. STCIN is the 
restriction ofSTCIQ allowing only unqualified number restrictions (i.e., concepts 
of the form ^nR and ^nR), while STCIT represents a further restriction where, 
instead of arbitrary number restrictions, only functional restrictions of the form 
^lf? and their negation ^2R may occur. 

An interpretation X = (A 1 , • I ) consists of a set A 1 , called the domain of I, 
and a function - 1 which maps every concept to a subset of A 1 and every role 

1 This logic has previously been called ACC^ , but this becomes too cumbersome when 
adding letters to represent additional features. 
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Construct Name 


Syntax 


Semantics 




atomic concept 


A 


A L C A^ 




atomic role 


R 


R 1 C A x x A x 




transitive role 


R e R+ 


R 1 = (R x )+ 




conjunction 


CUD 


c x r\D x 




disjunction 
negation 


CUD 
-.C 


C X UD X 
A X \C X 




exists restriction 


3R.C 


{x | 3y.(x, y) G R x and y € C 1 } 




value restriction 


VR.C 


{x Vy.(x, y) € i? x implies y G C x } 




role hierarchy 


RUS 


R 1 C ^ 


W 


inverse role 


R- 


| (y,a?> e J^} 


X 


number 


^nR 


{x | |t{y.<a:,V> 6i^} >n} 


A/" 


restrictions 




{x\t{y.{x,y) £R x }^n} 


qualifying number 


^nR.C 


{x | ${y.(x,y) 6 i? 2 and i/EC 1 }^ n} 


Q 


restrictions 


s^nR.C 


{ir tt{y.(x,?/) G i? x and y G C x } < n} 





Figure 1 : Syntax and semantics of the 6>Z family of DLs 

to a subset of A x x A x such that, for all concepts C , D, roles R, S, and non- 
negative integers n, the properties in Figure |I| are satisfied, where j}M denotes 
the cardinality of a set M. An interpretation satisfies a role hierarchy TZ iff 
R x C S x for each R C S G TZ; we denote this fact by X |= TZ and say that X is 
a model of TZ. 

A concept C is called satishable with respect to a role hierarchy TZ iff there 
is some interpretation X such that X \= TZ and C x ^ 0. Such an interpretation 
is called a model of C w.r.t. TZ. A concept D subsumes a concept C w.r.t. 
TZ (written C D) iff C x C D x holds for each model X of TZ. For an 
interpretation X, an individual x G A x is called an instance of a concept C iff 
x G C x . 

All DLs considered here are closed under negation, hence subsumption and 
(un)satisfiability w.r.t. role hierarchies can be reduced to each other: C C-r. D 
iff Cn-i-D is unsatisfiablc w.r.t. TZ, and C is unsatisfiable w.r.t. TZ iff C Q-r A\1—iA 
for some concept name A. 



In [jKozcn fc Tiuryn, 1990j |Baadcr, 1990| , |Schild, 199l| |Baader et al, 19931 , 



the inter nalisation of terminological axioms is introduced, a technique that re- 
duces reasoning with respect to a (possibly cyclic) terminology to satisfiability 



of concepts. In [Horrocks, 1998t], we saw how role hierarchies can be used for 
this reduction. In the presence of inverse roles, this reduction must be slightly 
modified. 

Definition 2.2 

A terminology T is a finite set of general concept inclusion axioms, T = {C\ C 
D\, . . . , C n C D„}, where Ci, Di are arbitrary STiXT '-concepts. An interpre- 
tation X is said to be a model of T iff C x C D x holds for all d C Di G T . 
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A concept C is satisfiable with respect to T iff there is a model X of T with 
C x 7^ 0. Finally, D subsumes C with respect to T iff, for each model X of T, 
we have C 1 C D 1 . 

The following lemma shows how general concept inclusion axioms can be in- 
ternalised using a "universal" role U, a transitive super-role of all roles occurring 
in T and their respective inverses. 

Lemma 2.3 Let T be a terminology, TZ a role hierarchy, and C, D STiXT- 

concepts, and let 

Let U be a transitive role that does not occur in T,C, D, or 1Z. We set 

TZu := TZ U {R C U, \m(R) \R occurs in T, C, D, or TZ}. 

Then C is satisfiable w.r.t. T and TZ iff C l~l C<r n VU.Ct is satisfiable w.r.t. 
TZu- Moreover, D subsumes C w.r.t. T and TZ iff C n ->D n Ct n VC/.Ct is 
unsatisfiable w.r.t. IZjj. 



The proof of Lemma 2.3 is similar to the ones that can be found in [Bchild, 1991 
Baader, 1990 1 . Most importantly, it must be shown that, (a) if a ^HZ^-concept 
C is satisfiable with respect to a terminology T and a role hierarchy TZ, then C, 
T, and TZ have a connected model, and (b) if y is reachable from x via a role path 
(possibly involving inverse roles) in a model of T and TZu, then (x,y) G C/ J . 
These are easy consequences of the semantics and the definition of U. 

Theorem 2.4 

Satisfiability and subsumption of SHX J 7 -concepts (resp. SHX-concepts) w.r.t. 
terminologies and role hierarchies are polynomially reducible to (un ) satisfiability 
of STil J- -concepts (resp. SHX-concepts) w.r.t. role hierarchies. 



3 Blocking 

The algorithms we are going to present for deciding satisfiability of SX- and 



iSHZF-concepts use the tableaux method [ Hollunder et at, 199C ] , in which the 
satisfiability of a concept D is tested by trying to construct a model of D. The 
model is represented by a tree in which nodes correspond to individuals and 
edges correspond to roles. Each node x is labelled with a set of concepts £(x) 
that the individual x must satisfy, and edges are labelled with (sets of) role 
names. 

An algorithm starts with a single node labelled {D}, and proceeds by repeat- 
edly applying a set of expansion rules that recursively decompose the concepts 
in node labels, new edges and nodes being added as required in order to satisfy 
3R.C or (^ 2 F) concepts. The construction terminates either when none of 
the rules can be applied in a way that extends the tree, or when the discovery 
of obvious contradictions demonstrates that D has no model. 
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In order to prove that such an algorithm is a sound and complete decision 
procedure for concept satisfiability in a given logic, it is necessary to demonstrate 
that the models it constructs are correct with respect to the semantics, that it 
will always find a model if one exists, and that it always terminates. The 
first two points can usually be dealt with by proving that the expansion rules 
preserve satisfiability, and that in the case of non-deterministic expansion (e.g., 
of disjunctions) all possibilities are exhaustively searched. For logics such as 
ACC, termination is mainly due to the fact that the expansion rules can only 
add new concepts that are strictly smaller than the decomposed concept, so the 
model must stabilise when all concepts have been fully decomposed. As we will 
see, this is no longer true in the presence of transitive roles. 



3.1 Transitive Roles vs. Transitive Closure 

We have argued that reasoning for logics with transitive roles is empirically more 



tractable than fo r logics that allow for transitive closure of roles [Sattler, 1996 



Horrocks, 19981: |. In this section we will give some justifi cation for that cl aim 



The st arting point for our investigations are the logics STi. { Horrocks, 19981: ] and 



ACC+ [ Baader, 1990 , which extend ACC by transitive roles and role hierarchies 
or transitive closure of roles respectively. Syntactically, ACC+ is similar to S, 
where, in addition to transitive and non-transitive roles, the transitive closure 
R + of a role R may appear in existential and universal restrictions. Formally, 
R + is interpreted by 



(R+ f = (J (R x )\ where (R 1 ) 1 = 



R x , if i = 1 

R x o(R I y~ 1 J otherwise 



For both SH and ACC+, concept satisfiability is an ExPTiME-complete 
problem. This result is easily derived from the ExPTiME-hardness proof for 
PP L in [Fischer fc Ladner, 1979( 1 and from the proof that PDL is in Exptime 



in [Pratt, 1979 1 . Nevertheless, implementations of algorithms for SH exhibit 



good performance in realistic applications [Horrocks & Patel-Schneider, 1999 1 
whereas, at the moment, this seems to be more problematical for ACC+. We 
believe that the main reason for this discrepancy, at least in the case of tab- 
leau algorithm implementations, lies in the different complexity of the blocking 
conditions that are needed to guarantee the termination of the respective algo- 
rithms. In the following we are going to survey the blocking techniques needed 
to deal with SH and its subsequent extensions to SHT and SHIT. To underpin 
our claim that reasoning with transitive roles empirically leads to more efficient 
implementations than for transitive closure, we will also present the blocking 
techniques used to deal with transitive closure. These are more complicated 
and introduce a larger degree in non-determinism into the tableaux algorithms, 
leading to inferior performance of implementations. 



G 



3.2 Blocking for S and SH 



Termination of the expansion process of a tableaux algorithm is not guaranteed 
for logics that include transitive roles, as the expansion rules can introduce new 
concepts that are the same size as the decomposed concept. In particular, \/R.C 
concepts, where R is a transitive role, are deal t with by propagating the whole 
concept across i?-labelled edges [ Battler, 1996 ]. For example, given a node x 
labelled {C, 3R.C, VR.(3R.C)}, where R is a transitive role, the combination of 
the 3R.C and \/R.(3R.C) concepts would cause a new node y to be added to 
the tree with a label identical to that of x. The expansion process could then 
be repeated indefinitely. 

This problem can be dealt with by blocking: halting the expansion process 
when a cycle is detected [ Baader, 1990| , Buchheit et al, 1993 . For logics with- 
out inverse roles, the general procedure is to check the label of each new node 
y, and if it is a subset [Baader et al, 1996 of the label of an ancestor node x, 
then no further expansion of y is performed: x is said to block y. The resulting 
tree corresponds to a cyclical model in which y is identified with x. 

To deal with the transitive closure of roles, tableaux algorithms proceed by 
non-deterministically expanding a concept 3R + .C to either 3R.C or 3R.3R + .C . 
Again, since the size of concepts along a path in the tree may not decrease, block- 
ing techniques are necessary to guarantee termination. An adequate blocking 
condition for ACC+ is identical as for S7i : but one has to distinguish between 
good and bad cycles. Consider the following concept: 



D = 3R+.A n \/R + .^A n -iA 



While D is obviously not satisfiable, a run of a tableaux algorithm might gen- 
erate the following tableau in which node y is blocked by node x without gen- 
erating any obvious contradictions. 

. x 3R+.A, VR+.^A, 3R.3R+A, -iA 

R 

l y 3R+A, MR+.^A, 3R.3R+A, -*A 

The problem is that 3R + .A has always been expanded to 3R.3R + A, postponing 
the satisfaction of A a further step. To obtain a correct tableaux algorithm for 
ACC+ , the blocking condition must include a check to ensure that each concept 
3R + .C appearing in such a cycle is expanded to 3R.C somewhere in the cycle. 
Such cycles are called good cycles, whereas cycles in which 3R + .C has always 
been expanded to 3R.3R + .C are called bad cycles. A valid model may only 
contain good cycles. 

Summing up, using transitive closure instead of transitive roles has a twofold 
impact on the empirical tractability: (a) in blocking situations, good cycles have 
to be distinguished from bad ones, and (b) the non-deterministic expansion of 
concepts of the form 3R + .C increases the size of the search space. 
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nA 3R.3R+.C 



3R+.C, 3R.C 



C 



, x ->A, 3R.3R+.C 



R 



, v 3R+.C, 3R.3R+.C 



3R+.C3R.3R+.C 



Figure 2: Dynamic blocking fails in the presence of transitive closure. 



3.3 Adding Inverse Roles 

Blocking is more problematical when inverse roles are added to the logic, and 
a key feature of the algorithms presented in [ HorrocksfeSattler, 1999 1 was the 
introduction of a dynamic blocking strategy. Besides using label equality in- 
stead of subset, this strategy allowed blocks to be established, broken, and 
re-established. With inverse roles the blocking condition has to be considered 
more carefully because roles are now bi-directional, and additional concepts in 
x's label could invalidate the model with respect to y's predecessor. This prob- 
lem can be overcome by allowing a node x to be blocked by one of its ancestors 
y if and only if they were labelled with the same sets of concepts. 

Dealing with inverse roles is even more complicated in the presence of tran- 
sitive closure. As an example consider the following concept: 

D = n 3R.3R + .C 
C = VR-.{VR-.A) 

Fig. H shows two possible tableau expansions of the concept D. Continuing 
the expansion of the left hand tree will necessarily lead to a clash when concept 
C G &(z) is expanded as this will lead to both A and -iA appearing in L(x). 
The right hand tree is also invalid as it contains a bad cycle: L(y) = L(z) but 
3R + .D has always been expanded to 3R.3R + .D. Nevertheless, C is satisfiable, 
as would be shown by continuing the expansion of the right hand path for one 
more step. 



In | Dc Giacomo fe Massacci, 1999 ], a solution to this problem for CPDL, a 
strict superset of ACC2+ (ACC+ plus inverse roles) is presented. The solution 
consists of an additional expansion rule called the look behind analytical cut. 
This rule employs exhaustive non-deterministic guessing to make the past of 
each node in the tree explicit in the labelling of that node: if y is an i?-successor 
of a node x, then 3R~ .C or Vi? _ .-iC is added non-deterministically to the label 
of y for each concept C that may appear during the expansion process. Obvi- 
ously, this leads to a further large increase in the size of the search space, with 
a correspondingly large adverse impact on empirical tractability. Experience 
with this kind of exhaustive guessing leads us to believe that an implementa- 
tion of such an algorithm would be disastrously inefficient. The non-existence 
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of implementations for ACCI+ or CPDL might be taken to support this view. 



3.4 Pair-wise Blocking 

Further extending the logic STil to SHIT by adding functional restrictions 
(concepts of the form 1 R), meaning that an individual can be related to at 
most one other individual by the role R) introduces new problems associated 
with the fact that the logic no longer has the finite model property. This means 
that there are concepts that are satisfiable but for which there exists no finite 
model. An example of such a concept is 

-.C n 3F-.(C n « 1 F)) n wr .(3F~.(C n « l f))), 

where R is a transitive role and F C R. Any model of this concept must 
contain an infinite sequence of individuals, each related to a single successors by 
an F~ role, and each satisfying C Fl 3F~.C, the 3F~ .C term being propagated 
along the sequence by the transitive super-role R. Attempting to terminate the 
sequence in a cycle causes the whole sequence to collapse into a single node due 
to the functional restrictions IF), and this results in a contradiction as both 
C and -iC will be in that node's label. 

In order to deal with infinite models — namely to have an algorithm that ter- 
minates correctly even if the input concept has only infinite models — a more so- 



phisticated pair-wise blocking strategy was introduced in plorrocks fc Sattler, 1999 1 
and soundness was proved by demonstrating that a blocked tree always has a 
corresponding infinite model.n 

The only known algorithm that is able to deal with the combination of tran- 
sitive closure, inverse roles, and functional restrictions on roles relies on an elabo- 



rate polynomial reduction to a CPDL terminology De Giacomo & Lcnzcrini, 1994] 
and the capability of CPDL to internalise the resulting general terminological 
axioms. The large number and the nature of the axioms generated by this re- 
duction make it very unlikely that an implementation with tolerable runtime 
behaviour will ever emerge. 



4 Reasoning for ST Logics 



In this section, we present two tableaux algorithms: the first decides satisfiability 
of <SHZ". F-con cepts, and can be used for all SHIT reasoning problems (see 
Theorem 2.4); the second decides satisfiability (and hence subsumption) of SI- 
concepts in Pspace. In this paper we only sketch most of the proofs . For details 
on the <SWT.F-algorithm, please refer to |Horrocks fc Sattler, 1999 , for details 



on the SI- and SZAA-algorithm, please refer to |Horrocks et aL, 1998 



The correctness of the algorithms can be proved by showing that they create 
a tableau for a concept iff it is satisfiable. 

For ease of construction, we assume all concepts to be in negation normal 
form (NNF), that is, negation occurs only in front of concept names. Any 

2 This is not to say that it may not also have a finite model. 



9 



SHTT-concept can easily be transformed to an equivalent one in NNF by push- 
ing negations inwards [Hollundcr et at, 1990 . 



Definition 4.1 

Let D be a SUIT -concept in NNF, 1Z a role hierarchy, and R.£> the set of roles 
occurring in D together with their inverses, and sub(D) the subconcepts of D. 
Then T = (S,£,£) is a tableau for D w.r.t. 1Z iff S is a set of individuals, 
£ : S — > 2 sub(D ^ maps each individual to a set of concepts, £ : R_d — ► 2 s xS 
maps each role to a set of pairs of individuals, and there is some individual 
s £ S such that D G £(s). Furthermore, for all s,t G S, C,E g sub(D), and 
R,SeK D , it holds that: 

1. ifC G £(«), then -.C $ L(s), 

2. ifCnEe £(s), then C* 6 £(s) and E g £(s), 

3. if C U £ g £(s), then C G £(s) or E e £(s), 

4. ifVR.C G £(s) and {s,t) G £(#), then C G £(*), 

5. if 3R.C G £(s), then there is some i G S such that (s,t) G £(i?) and 
C G £(t), 

6. if VS.C G £(s) and (s,t) G E(R) for some RUtS with Trans(i?), then 
VR.CeL(t), 

7. (s,t) G £(i?) iff{t,s) G £(lnv(i?)). 

8. if{x,y) G £(i?) and R^S, then (x,y) G £(5), 

9. if <LR G £(s), then )j{f | (a,t) G £(i?)} < 1, and 
10. if >2J? G £(s), then ${t \ (s,t) G £(f?)} > 2. 

Tableaux for SX-concepts are defined analogously and must satisfy Proper- 
ties 1-7, where, due to the absence of a role hierarchy, H is the identity. 

Due to the close relationship between models and tableaux, the following 
lemma can be easily proved by induction on the structure of concepts. As a 
consequence, an algorithm that constructs (if possible) a tableau for an input 
concept is a decision procedure for satisfiability of concepts. 

Lemma 4.2 A SUIT '-concept (resp. SX-concept) D is satisfiable w.r.t. a role 
hierarchy 1Z iff D has a tableau w.r.t. 1Z. 

4.1 Reasoning in SJiTT 

In the following, we give an algorithm that, given a SUIT -concept D, decides 
the existence of a tableaux for D. We implicitly assume an arbitrary but fixed 
role hierarchy 1Z. 
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Definition 4.3 

A completion tree for a STilJ- -concept D is a tree where each node x of the 
tree is labelled with a set L(x) C sub(D) and each edge (x, y) is labelled with a 
set L((x,y)) of (possibly inverse) roles occurring in sub(D). 

Given a completion tree, a node y is called an f?-successor of a node x iff y 
is a successor of x and S £ £((x, y}) for some S with S E i?. A node y is called 
an f?-neighbour of x iff y is an R-successor of x, or if x is an \nv(R)-successor 
of y. Predecessors and ancestors are defined as usual. 

A node is blocked iff it is directly or indirectly blocked. A node x is directly 
blocked iff none of its ancestors are blocked, and it has ancestors x' , y and y' 
such that 

1. x is a successor of x' and y is a successor of y' and 

2. £j(x) = L(y) and L(x') — L(y') and 

3. H({x>,x))=Jl((y>,y)). 

In this case we will say that y blocks x. 

A node y is indirectly blocked iff one of its ancestors is blocked, or — in order 
to avoid wasted expansion after an application of the ^-rule — it is a successor 
of a node x and &((x, y)) — 0. 

For a node x, L(x) is said to contain a clash iff {A, ->A} C L(x) or{^2R, ^15} C 
&(x) for roles RHiS. A completion tree is called clash- free iff none of its nodes 
contains a clash; it is called complete iff none of the expansion rules in Figure 
is applicable. 

For a SHI T '-concept D in NNF, the algorithm starts with a completion tree 
consisting of a single node x with L(x) = {D}. It applies the expansion rules, 
stopping when a clash occurs, and answers "D is satishable" iff the completion 
rules can be applied in such a way that they yield a complete and clash-free 
completion tree. 

The soundness and completeness of the tableaux algorithm is an immediate 



consequence of Lemmas 4.2 



and 4.4 



Lemma 4.4 Let D be an SriTT -concept. 

1. The tableaux algorithm terminates when started with D. 

2. If the expansion rules can be applied to D such that they yield a complete 
and clash-free completion tree, then D has a tableau. 

3. If D has a tableau, then the expansion rules can be applied to D such that 
they yield a complete and clash-free completion tree. 

Before we sketch the ideas of the proof, we will discuss the different expansion 
rules and their correspondence to the language constructors. 



The fl-, U-, 3- and V-rules are the standard ACC tableaux rules | Schmidt-Schaufi & Smolka, 1991 



The V+-rule is used to handle transitive roles, where the E -clause deals with 



the role hierarchy. See Horrocks & Sattler, 1999 for details 
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n-rule: if 1. C\ l~l C 2 G &(x), x is not indirectly blocked, and 
2. {d,C 2 }^L(x) 
then £(x) — > L{x) U {Ci, C 2 } 
U-rule: if 1. C\ U C2 G £(x), x is not indirectly blocked, and 
2. {C7i,C 2 }n£(a;) =0 
then, for some C G {C x , C 2 }, &(x) — > &(x) U {C} 
3-rule: if 1. 3S.C G £(21), a; is not blocked, and 

2. x has no ^-neighbour y with C G £(y) 
then create a new node y with 

£«x,y)) = and = {C} 

V-rule: if 1. VS.C G £(x), x is not indirectly blocked, and 
2. there is an S- neighbour y of x with C ^ L(y) 
then £(y) — ► U {C*} 

V^-rule: if 1. V5.C G <C(z), x is not indirectly blocked, 

2. there is some i? with Trans(i?) and i?ES, and 

3. x has an i?- neighbour y with MR.C ^ £>(y) 
then £(y) — * U {\/R.C} 

^-rule: if 1. 2 i?) G £(x), x is not blocked, and 

2. there is no i?-neighbour y of x with A G 
then create two new nodes y\, y 2 with 
H((x,yi))=L((x,y 2 )) = {R}, 
L( Vl ) = {A} and L(y 2 ) = {^A} 
^-rulc: if 1. 1 R) G £(x), x is not indirectly blocked, 

2. x has two i?-neighbours y and z s.t. y is not an ancestor of z, 
then 1. £(z) — ► £(2) U £(2/) and 

2. if z is an ancestor of y 

then £((z,x)) — > £((z,x)) U lnv(£((z, y))) 
else £((x, z)) — ► z)) U £((x, y)) 

3. £,((a!,y))— »0 

Figure 3: The tableaux expansion rules for STLXJ- 

The functional restriction rules merit closer consideration. In order to guar- 
antee the satisfaction of a ^2i?-constraint, the ^-rule creates two successors and 
uses a fresh atomic concept A to prohibit identification of these successors by 
the ^-rule. If a node x has two or more -R-neighbours and contains a functional 
restriction ^li?, then the ^-rule merges two of the neighbours and also merges 
the edges connecting them with x. Labelling edges with sets of roles allows 
a single node to be both an R and S'-successor of x even if R and S are not 
comparable by \± . Finally, contradicting functional restrictions are taken care 
of by the definition of a clash. 



We now sketch the main ideas behind the proof of Lemma 4.4: 
1. Termination: Let m = \sub(D)\ and n = \Rd\- Termination is a 
consequence of the following properties of the expansion rules: 
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(a) The expansion rules never remove nodes from the tree or concepts from 
node labels. Edge labels can only be changed by the ^-rule which either expands 
them or sets them to 0; in the latter case the node below the 0-labelled edge is 
blocked, (b) Successors are only generated for concepts of the form 3R.C and 
>2i?. For a node x, each of these concepts triggers the generation of at most 
two successors. If for one of these successors y the ^-rule subsequently causes 
£>((x, y)) to be changed to 0, then x will have some i?-neighbour z with &(z) 3 
&(y). This, together with the definition of a clash, implies that the concept that 
led to the generation of y will not trigger another rule application. Obviously, 
the out-degree of the tree is bounded by 2m. (c) Nodes are labelled with non- 
empty subsets of sub(D) and edges with subsets of Rd, so there are at most 
2 2mn different possible labellings for a pair of nodes and an edge. Therefore, on 
a path of length at least 2 2mn there must be 2 nodes x, y such that x is directly 
blocked by y. Since a path on which nodes are blocked cannot become longer, 
paths are of length at most 2 2mn . 

2. Soundness: A complete and clash-free tree T for D induces the existence 
of a tableaux T = (S, £, £) for D as follows. Individuals in S correspond to paths 
in T from the root node to some node that is not blocked. Instead of going to a 
directly blocked node, these paths jump back to the blocking node, which yields 
paths of arbitrary length. Thus, if blocking occurs, this construction yields an 
infinite tableau. This rather complicated tableau construction is necessary due 
to the presence of functional restrictions; its validity is ensured by the blocking 
condition, which considers both the blocked node and its predecessor. 

3. Completeness: A tableau T — (S,£,£) for D can be used to "steer" 
the application of the non-deterministic U- and ^-rules in a way that yields a 
complete and clash-free tree. 



The following theorem is an immediate consequence of Lemma 4.4 , Lemma 4.2 
and Lemma |2.3l 



Theorem 4.5 

The tableaux algorithm is a decision procedure for the satisfiability and sub- 
sumption of SHI T -concepts with respect to terminologies and role hierachies. 

4.2 A PSpace-algorithm for SX 

To obtain a PSPACE-algorithm for SI, the SHIT algorithm is modified as 
follows: (a) As SI does not allow for functional restrictions, the ^- and the ^- 
rule can be omitted; blocking no longer involves two pairs of nodes with identical 
labels but only two nodes with "similar" labels, (b) Due to the absence of role 
hierarchies, edge labels can be restricted to roles (instead of sets of roles), (c) 
To obtain a PSpace algorithm, we employ a refined blocking strategy which 
necessitates a second label H for each node. This blocking technique, while 



discovered independently, is based on ideas similar to those used in [Spaan, 1993 1 
to show that satisfiability for K4t can be decided in PSpace.^] In the following, 

3 The modal logic K4t is a syntactic variant of SI with only a single transitive role name. 
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we will describe and motivate this blocking technique; detailed proofs as well as 



a similar result for SIN can be found in [Horrocks et al, 1998 



Please note that naively using a cut rule does not yield a PSpace algorithm: a 



cut rule similar to the look behind analytical cut presented in [De Giacomo & Massacci, 1999] 
(non-deterministically) guesses which constraints will be propagated "up" the 
completion tree by universal restrictions on inverted roles. For SI, this tech- 
nique may lead to paths of exponential length due to equality blocking. A way 
to avoid these long paths would be to stop the investigation of a path at some 
polynomial bound. However, to prove the correctness of this approach, it would 



be necessary to establish a "short-path-model" property similar to Lemma 4.8 
Furthermore, we believe that our algorithm is better suited for an implementa- 
tion since it makes less use of "don't-know" non-determinism. This also distin- 



guishes our approach from the algorithm presented |5paan, 1993 , which is not 



intended to form the basis for an efficient implementation. 
Definition 4.6 

A completion tree for a SI concept D is a tree where each node x of the tree 
is labelled with two sets IB (a;) C <C(ar) C sub(D) and each edge (x,y) is labelled 
with a (possibly inverse) role L((x, y)) occurring in sub(D). 



R-neighbours, -successors, and -predecessors are defined as in Dchnition 4.5. 
Due to the absence of role hierarchies, QE is the identity on Rd ■ 
A node x is blocked iff, for an ancestor y, y is blocked or 

B(a;)C%) and £(s)/ lnv(5) = L(y)/ Inv(S), 

wherex' is the predecessor of x , L({x' ,x)) = S, andL(x)/ Inv(S) = {Vlnv(5).C G 
L{x)}. 

For a node x, L(x) is said to contain a clash iff {A, ->A} C L(x). A comple- 
tion tree to which none of the expansion rules given in Figure ^ is applicable is 
called complete. 

For an Si-concept D, the algorithm starts with a completion tree consisting 
of a single node x with H>(x) — L(x) = {D}. It applies the expansion rules in 
Figure f^, stopping when a clash occurs, and answers "D is satisfiable" iff the 
completion rules can be applied in such a way that they yield a complete and 
clash-free completion tree. 

As for SUIT, correctness of the algorithm is proved by first showing that a 
<SI-concept is satisfiable iff it has a tableau, and next proving the 5T-analogue 



of Lemma 4.4 



Theorem 4.7 

The tableaux algorithm is a decision procedure for satisfiability and subsump- 
tion of Si-concepts. 

The dynamic blocking technique for SI and STil described in Section ^|, 
which is based on label equality, may lead to completion trees with exponentially 
long paths because there are exponentially many possibilities to label sets on 
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n-rule: if 1. Ci n C 2 E and 
2. {C U C 2 } %H(x) 
then £(a;) — > L{x) U {Ci, C 2 } 
U-rule: if 1. Ci U C 2 E £>{x) and 

2. {C 1 ,C 2 }nH(x)=Q 

then — ► L{x) U {C} for some C £{d,C 2 } 
V-rule: if 1. VS.C E and 

2. there is an S-successor y of x with C ^ 23(y) 
then L(y) — > L(y) U {C} and 
2(y) — >£(?/) U{C} or 
2'. there is an S-predecessor y of x with C ^ 
then£(y) — ► U {C}. 
V+-rule: if 1. VS.C £ H(x) and Trans(S) and 

2. there is an S'-successor y of x with VS.C ^ H(y) 
then — ► L(y) U {VS.C} and 
2(y) — > U {VS.C} or 
2'. there is an S'-predecessor y of x with VS.C ^ £(y) 
then L{y) — ► L(y) U {VS.C}. 
3-rulc: if 1. 3S.C G a; is not blocked and no other rule 

is applicable to any of its ancestors, and 
2. x has no S-neighbour y with C E L(y) 
then create a new node y with 

L((x,y)) = S and = = {C} 

Figure 4: Tableaux expansion rules for ST 
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such a path. Due to the non-deterministic U-rule, these exponentially many sets 
may actually occur. 

This non-determinism is not problematical for S because disjunctions need 
not be completely decomposed to yield a subset-blocking situation. For an 
optimal ST algorithm, the additional label 25 was introduced to enable a sort 
of subset-blocking which is independent of the U-non-determinism. Intuitively, 
53 (x) is the restriction of L(x) to those non-decomposed concepts that x must 
satisfy, whereas L(x) contains boolean decompositions of these concepts as well 
as those that are imposed by value restrictions in descendants. If x is blocked 
by y, then all concepts in 25 (x) are eventually decomposed in L(y) (if no clash 
occurs). However, in order to substitute x by y, x's constraints on predecessors 
must be at least as strong as y's; this is taken care of by the second blocking 
condition. 

Let us consider a path xx,...,x n where all edges are labelled R with Trans(_R), 
the only kind of paths along which the length of the longest concept in the la- 
bels might not decrease. If no rules can be applied, we have £(xj+i)/ lnv(i?) C 
Ufa)/ lnv(i?) and H(xi) C 23(xj+i) U {Cj} (where BR.Ci triggered the genera- 
tion of Xi+x). This limits the number of labels and guarantees blocking after a 
polynomial number of steps. 

Lemma 4.8 The paths of a completion tree for a concept D have a length of 
at most m 4 where m = \sub(D)\. 

Finally, a slight modification of the expansion rules given in Figure |i| yields 
a PSpace algorithm. This modification is necessary because the original algo- 
rithm must keep the whole completion tree in its memory — which needs expo- 
nential space even though the length of its paths is polynomially bounded. The 
original algorithm may not forget about branches because restrictions which are 
pushed upwards in the tree might make it necessary to revisit paths which have 
been considered before. We solve this problem as follows: 

Whenever the V- or the V+-rule is applied to a node x and its predecessor y 
(Case 2' of these rules), we delete all successors of y from the completion tree. 
While this makes it necessary to restart the generation of successors for y, it 
makes it possible to implement the algorithm in a depth-first manner which 
facilitates the re-use of space. 

This modification does not affect the proof of soundness and completeness 



for the algorithm, but we have to re-prove termination Horrocks et aZ., 1998 
as it relied on the fact that we never removed any nodes from the completion 
tree. Summing up we get: 

Theorem 4.9 

The modified algorithm is a PSpace decision procedure for satisfiability and 
subsumption of ST-concepts. 
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5 The Undecidability of Unrestricted STiM 



In [ Horrocks et at, 1999[ we describe an algorithm for SH2Q based on the 



STiT J- -algorithm already presented. Like earlier DLs that combine a hier- 
archy of ( transitive and non-trans i tive) roles with some form of number re- 



strictions Horrocks fe Sattler, 1999, Horrocks et al, 1998 1 and SUIT, the DL 



STilQ allows only simple roles in number restrictions. The justification for this 
limitation has been partly on the grounds of a doubtful semantics (of transitive 
functional roles) and partly to simplify decision procedures. In this section we 
will show that, even for the simpler STLM logic, allowing arbitrary roles in num- 
ber restrictions leads to undecidability, while decidability for the corresponding 
variant of SJiTJ- is still an open problem. For convenience, we will refer to 
S7iM with arbitrary roles in number restrictions as S7iN + . 

The undecidability proof uses a reduction of the domino problem | Berger, 196(| 



adapted from [Baader & Sattler, 1996 . This problem asks if, for a set of domino 
types, there exists a tiling of an N 2 grid such that each point of the grid is 
covered with one of the domino types, and adjacent dominoes are "compatible" 
with respect to some predefined criteria. 

Definition 5.1 

A domino system V = (D,H,V) consists of a non-empty set of domino types 
D = {Di, . . . , D n }, and of sets of horizontally and vertically matching pairs 
H C D x D and V C D x D. The problem is to determine if, for a given T>, 
there exists a tiling of an N x N grid such that each point of the grid is covered 
with a domino type in D and all horizontally and vertically adjacent pairs of 
domino types are in H and V respectively, i.e., a mapping t : N x N — > D such 
that for all m, n £ N, (t(m, n),t{m + 1, n)) 6 H and (t{m, n), t(m, n + 1)) € V. 

This problem can be reduced to the satisfiability of iS7-L/V" + -concepts, and 
the undecidability of the domino problem implies undecidability of satisfiability 
of iS7iW + -concepts. 

Ensuring that a given point satisfies the compatibility conditions is simple 
for most logics (using value restrictions and boolean connectives) , and applying 
such conditions throughout the grid is also simple in a logic such as STLM + which 
can deal with arbitrary axioms. The crucial difficulty is representing the NxN 
grid using "horizontal" and "vertical" roles X and Y, and in particular forcing 
the coincidence ofloy and Y o X successors. This can be accomplished in 
S7iM + using an alternating pattern of two horizontal roles X\ and X2, and two 
vertical roles Y\ and Y%, with disjoint primitive concepts A, B, C , and D being 
used to identify points in the grid with different combinations of successors. The 
coincidence oi X oY and Y o X successors can then be enforced using number 
restrictions on transitive super-roles of each of the four possible combinations of 
X and Y roles. A visualisation of the resulting grid and a suitable role hierarchy 
is shown in Figure |5|, where S® are transitive roles. 

The alternation of X and Y roles in the grid means that one of the transitive 
super-roles 5® connects each point (x, y) to the points (x + 1, y), (x, y + 1) and 
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Figure 5: Visualisation of the grid and role hierarchy. 



(x+1, y+1), and to no other points. A number restriction of the form ^35® can 
thus be used to enforce the necessary coincidence of XoY and YoX successors. 
A complete specification of the grid is given by the following axioms: 

A c -<B n -iC n ->D n 3X V B n 3Y X .C n s$3S®, 
Bc^nnCnnfln 3A 2 .yl n 3Yl.D n <35Jf x , 
c c n -nB n -iD n 3Xi.B n ay 2 .A n ^35f 2 , 
DCninnBnnCn 3X 2 .C n ay 2 -B n <35 2 ® 2 . 

It only remains to add axioms which encode the local compatibility conditions 
(as described in [ Baadcr fc Sattlcr. 1996{ ) and to assert that A is subsumed by 
the disjunction of all domino types. The t S7YA/' + -concept A is now satisfiable 



w.r.t. the various axioms (which can be internalised as described in Lemma 2.3) 
iff there is a compatible tiling of the grid. 



6 Implementation and Optimisation 



The development of the ST family of DLs has been motivated by the desire to 
implement systems with good typical case performance. As discussed in Sec- 
tion this is achieved in part through the design of the logics and algorithms 
themselves, in particular by using transitive roles and by reasoning with number 
restrictions directly, rather than via encodings. Another important feature of 
these algorithms is that their relative simplicity facilitates the application of a 
range of optimisation technique s. Several system s base d on S logics have now 
been imple mented (e.g., FaCT |Horrocks, 1998a) , DLP jPatel-Schneider, 1998 | 
and RACE | HaarslevfcMoller, 1999 |), and have demonstrated that suitable op- 
timisation techniques can lead to a dramatic improvement in the performance 
of the algorithms when used in realistic applications. A system based on the 
SHTT logic has also been implemented (iFaCT [ Horrocks, 1999| ) and has been 
shown to be similarly amenable to optimisation. 

DL systems are typically used to classify a KB, and the optimisation tech- 
niques used in such systems can be divided into four categories based on the 
stage of the classification process at which they are applied. 
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= {( A U B ): ( A U C)} fx) 

£(:r) U {A} =>■ clash f^\ {x) &(x)U{B} 

Z(x) U {A} =► clash fjA (x\ L(x)u{C} 

Figure 6: Syntactic branching search 

1. Preprocessing optimisations that try to modify the KB so that classifica- 
tion and subsumption testing are easier. 

2. Partial ordering optimisations that try to minimise the number of sub- 
sumption tests required in order to classify the KB. 

3. Subsumption optimisations that try to avoid performing a potentially ex- 
pensive satisfiability test, usually by substituting a cheaper test. 

4. Satisfiability optimisations that try to improve the typical case perfor- 
mance of the underlying satisfiability testing algorithm. 

Many optimisations in the first three categories are relatively independent of the 
underlying subsumption (satisfiability) testing algorithm and could be applied 
to any DL system. As we are mostly concerned with algorithms for the SI 
family of DLs we will concentrate on the fourth kind of optimisation, those that 
try to improve the performance of the algorithm itself. Most of these are aimed 
at reducing the size of the search space explored by the algorithm as a result of 
applying non-deterministic tableaux expansion rules. 

6.1 Semantic branching search 

Implementations of the algorithms described in the previous sections typically 
use a search technique called syntactic branching. When expanding the label 
of a node x, syntactic branching works by choosing an unexpanded disjunction 
(Ci U . . . U C n ) in £j(x) and searching the different models obtained by adding 
each of the disjuncts C%, . . . , C n to £j(x) ]Giunchiglia fc Scbastiani, 1996 1 . As 



the alternative branches of the search tree are not disjoint, there is nothing to 
prevent the recurrence of an unsatisfiablc disjunct in different branches. The 
resulting wasted expansion could be costly if discovering the unsatisfiability re- 
quires the solution of a complex sub-problem. For example, tableaux expansion 
of a node x, where {(A\JB), (AuC)} C Z(x) and A is an unsatisfiable concept, 
could lead to the search pattern shown in Figure |[ in which the unsatisfiability 
of L(x) U {A} must be demonstrated twice. 

This problem can be dealt with by using a semantic branching technique 
adapted from the Davis-Putnam-Logemann-Loveland proced ure (PPL) com- 



monly used to solve prepositional satisfiability (SAT) problems [Davis et ai, 1962 
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Freeman, 1996]. Instead of choosing an unexpanded disjunction in £j(x), a sin- 



gle disjunct D is chosen from one of the unexpanded disjunctions in L(x). The 
two possible sub-trees obtained by adding either D or -<D to Z(x) are then 
searched. Because the two sub-trees are strictly disjoint, there is no possibility 
of wasted search as in syntactic branching. Note that the order in which the 
two branches are explored is irrelevant from a theoretical viewpoint, but may 



offer further optimisation possibilities (see Section 6.4) 



Semantic branching search has the additional advantage that a great deal is 
known about the implementation and optimisation of the DPL algorithm. In 
particular, both local simplification (see Section 6.2) and heuristic guided search 
(see Section 6.4) can be used to try to minimise the size of the search tree 
(although it should be noted that both these techniques can also be adapted for 
use with syntactic branching search). 

There are also some disadvantages to semantic branching search. Firstly, it 
is possible that performance could be degraded by adding the negated disjunct 
in the second branch of the search tree, for example if the disjunct is a very large 
or complex concept. However this does not seem to be a serious problem in prac- 
tice, with semantic branching rarely exhibiting significantly worse performance 
than syntactic branching. Secondly, its effectiveness is problem dependent. It 
is most effective with randomly generated problems, particularly those that are 



over-constrained (likely to be unsatisfiable) [Horrocks & Patel-Schneider, 1999 1 . 
It is also effective with some of the hand crafted problems from the Tableaux'98 
benchmark suite Hcucrding& Schwcndimann, 1996, Balsiger fc Heuerding, 199^ 



However it is of little benefit when classifying realistic KBs [Horrocks & Patel-Schneider, 1998 



6.2 Local simplification 

Local simplification is another technique used to reduce the size of the search 
space resulting from the application of non-deterministic expansion rules. Before 
any non-deterministic expansion of a node label L(x) is performed, disjunctions 
in £j(x) are examined, and if possible simplified. The simplification most com- 
monly used is to detcrministically expand disjunctions in £j(x) that present only 
one expansion possibility and to detect a clash when a disjunction in L(x) has 
no expansion possibi lities. This simp lification has been called boolean constraint 
propagation (BCP) | Freeman, 1995 . In effect, the inference rule 



-■Ci,... ,^C n ,C 1 U...UC n UD 
D 

is being used to simplify the conjunctive concept represented by £j(x). For 
example, given a node x such that 

{(C U (£>! n Da)), (-Da U ^D 2 U C), -,C} C L{x), 

BCP detcrministically expands the disjunction (Cl_l(DinD2)), adding (D1IID2) 
to L(x), because -iC € L(x). The deterministic expansion of (Di n D2) adds 
both D\ and D2 to £>(x), allowing BCP to identify (->Di U -1D2 U C) as a clash 
(without any branching having occurred), because {Di, D2, —C} C Ju(x). 
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L(x) = {(AuB),(AuG)} g 

U {^} ^ c/as/i (x) &(x)\J{->A,B,C} 

Figure 7: Semantic branching search 



BCP simplification is usually described as an integral part of SAT based al- 
gorithms [ Giunchiglia fc Scbastiani, 1996[ , but it can also be used with syntactic 



branching. However, it is more effective with semantic branching as the negated 
concepts introduced by failed branches can result in additional simplifications. 
Taking the above example of {(A U B), (A U C)} C L(x), adding ->A to L(x) 
allows BCP to deterministically expand both of the disjunctions using the sim- 
plifications (A U B) and ->A — > B and (A U C) and ->A — > C. The reduced search 
space resulting from the combination of semantic branching and BCP is shown 
in Figure 0. 

Local simplification has the advantage that it can never increase the size 
of the search space and can thus only degrade performance to the extent of 
the overhead required to perform the simplification. Minimising this overhead 



does, however, require complex data structures Freeman, 1995 1, particularly in 
a modal/description logic setting. 

As with semantic branching, effectiveness is problem dependent, the opti- 
misation being most effective with over-constrained randomly generated prob- 
lems [ Horrocks fc Patcl-Schneider, 1998| . 



6.3 Dependency directed backtracking 

Inherent unsatisfiability concealed in sub-problems can lead to large amounts 
of unproductive backtracking search, sometimes called thrashing. For example, 
expanding a node x (using semantic branching), where 

L(x) = {(C l UD 1 ),... ,(C n UD n ),3R.(AnB),VR.^A}, 

could lead to the fruitless exploration of 2™ possible i?-successors of x before 
the inherent unsatisfiability is discovered. The search tree resulting from the 
tableaux expansion is illustrated in Figure |8[ 

This problem can be addressed by adapting a form of dependency directed 
backtracking called backjumping, which has been used in solving constraint 
satisfiability problems [ Baker, 1995| (a similar t echnique was also used in the 



HARP theorem prover Oppacher fc Suen, 198§[]). Backjumping works by la- 



belling each concept in a node label with a dependency set indicating the 
branching points on which it depends. A concept C G fl(x) depends on a 
branching point if C was added to L(x) at the branching point or if C G £j(x) 
was generated by an expansion rule (including simplification) that depends on 
another concept D G £j(y), and D G £>(y) depends on the branching point. A 
concept C G £>(x) depends on a concept D G &(y) when C was added to L(x) 
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Figure 8: Thrashing in backtracking search 



by a deterministic expansion that used D 6 £(?/)• For example, if A 6 

was derived from the expansion of (An B) e then A £ L(x) depends on 

(AnB)e L(x). 

When a clash is discovered, the dependency sets of the clashing concepts can 
be used to identify the most recent branching point where exploring the other 
branch might alleviate the cause of the clash. It is then possible to jump back 
over intervening branching points without exploring any alternative branches. 
Let us consider the earlier example and suppose that 3R.(A n B) has a depen- 
dency set Di and VR.^A has a dependency set Dj. The search proceeds until 
C\ . . . C n have been added to when 3R.(A n B) and \/R.-*A are deter- 

ministically expanded and a clash occurs in £j(y) between the A derived from 
3R.(A\1B) and the -*A derived from MR.^A. As these derivations were both de- 
terministic, the dependency sets will be and Dj respectively, and so UDj 
is returned. This set cannot include the branching points where C\ . . . C n were 
added to £(x) as and T)j were defined before these branching points were 
reached. The algorithm can therefore backtrack through each of the preceding 
n branching points without exploring the second branches, and will continue to 
backtrack until it reaches the branching point equal to the maximum value in 
Di UDj (if Di = Dj = 0, then the algorithm will backtrack through all branch- 
ing points and return "unsatisfiable" ) . Figure |^ illustrates the pruned search 
tree, with the number of i?-successors explored being reduced by an exponential 
number. 

Backjumping can also be used with syntactic branching, but the procedure is 
slightly more complex as there may be more than two possible choices at a given 
branching point, and the dependency set of the disjunction being expanded must 
also be taken into account. 

Like local simplification, backjumping can never increase the size of the 
search space. Moreover, it can lead to a dramatic reduction in the size of 
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clash 



Figure 9: Pruning the search using backjumping 



the search tree and thus a huge performance improvement. For example, when 
using cither FaCT or DLP with backjumping disabled in order to classify a large 



(«3,000 concept) KB derived from the European Galen project | Rector et a/., 1993 1 
single satisfiability tests were encountered that could not be solved even after 
several weeks of CPU time. Classifying the same KB with backjumping enabled 



takes less than 100s of CPU time for either FaCT or DLP porrocks k Patel-Schneider, 199S ] 

Backjumping's only disadvantage is the overhead of propagating and storing 
the dependency sets. This can be alleviated to some extent by using a pointer 
based implementation so that propagating a dependency set only requires the 
copying of a pointer. 

6.4 Heuristic guided search 

Heuristic techniques can be used to guide the search in a way that tries to 
minimise the size of the search tree. A method that is widely used in DPL 
SAT algorithms is to branch on the disjunct that has the Maximum number of 
Occurrences in disjunctions of Minimum Size — the well known MOMS heuris- 



tic [Freeman, 1995]. By choosing a disjunct that occurs frequently in small dis- 
junctions, the MOMS heuristic tries to maximise the effect of BCP. For example, 
if the label of a node x contains the unexpanded disjunctions CUDi, . . . , CUD n , 
then branching on C leads to their deterministic expansion in a single step: when 
C is added to £(x), all of the disjunctions are fully expanded and when ->C is 
added to L(x), BCP will expand all of the disjunctions, causing Di, . . . , D n to 
be added to L(x). Branching first on any of D\,.., ,D n , on the other hand, 
would only cause a single disjunction to be expanded. 

The MOMS value for a candidate concept C is computed simply by count- 
ing the number of times C or its negation occur in minimally sized disjunctions. 
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There are several variants of this heuristic, including the heuristic from Jeroslow 
and Wang [ Jeroslow & Wang, 199C]. The Jeroslow and Wang heuristic considers 
all occurrences of a disjunct, weighting them according to the size of the dis- 
junction in which they occur. The heuristic then selects the disjunct with the 
highest overall weighting, again with the objective of maximising the effect of 
BCP and reducing the size of the search tree. 

When a disjunct C has been selected from the disjunctions in &(x), a BCP 
maximising heuristic can also be used to determine the order in which the 
two possible branches, L(x) U {C} and L(x) U {^C}, are explored. This is 
done by separating the two components of the heuristic weighting contributed 
by occurrences of C and -iC, trying L{x) U {C} first if C made the smallest 
contribution, and trying L(x) U {^C} first otherwise. The intention is to prune 
the search tree by maximising BCP in the first branch. 

Unfortunately MOMS-style heuristics can interact adversely with the back- 
jumping optimisation because they do not take dependency information into 
account. This was first discovered in the FaCT system, when it was noticed 
that using MOMS heuristic often led to much worse performance. The cause 
of this phenomenon turned out to be the fact that, without the heuristic, the 
data structures used in the implementation naturally led to "older" disjunctions 
(those dependent on earlier branching points) being expanded before "newer" 
ones, and this led to more effective pruning if a clash was discovered. Using 
the heuristic disturbed this ordering and reduced the effectiveness of backjump- 
ing [Horrocks, 1997]. 

Moreover, MOMS-style heuristics are of little value themselves in descrip- 
tion logic systems because they rely for their effectiveness on finding the same 
disjuncts recurring in multiple unexpanded disjunctions: this is likely in hard 
propositional problems, where the disjuncts are prepositional variables, and 
where the number of different variables is usually small compared to the num- 
ber of disjunctive clauses (otherwise problems would, in general, be trivially 
satisfiable) ; it is unlikely in concept satisfiability problems, where the disjuncts 
are (possibly non-atomic) concepts, and where the number of different concepts 
is usually large compared to the number of disjunctive clauses. As a result, these 
heuristics will often discover that all disjuncts have similar or equal priorities, 
and the guidance they provide is not particularly useful. 

An alternative strategy is to employ an oldest-first heuristic that tries to 
maximise the effectiveness of backjumping by using dependency sets to guide 
the expansion | Horrocks fc Patel-Schneider, 1999 1 . When choosing a disjunct on 
which to branch, the heuristic first selects those disjunctions that depend on the 
least recent branching points (i.e., those with minimal maximum values in their 
dependency sets), and then selects a disjunct from one of these disjunctions. 
This can be combined with the use of a BCP maximising heuristic, such as the 
Jeroslow and Wang heuristic, to select the disjunct from amongst the selected 
disjunctions. 

The oldest-first heuristic can also be used to advantage when selecting the 
order in which existential role restrictions, and the labels of the i?-successors 
which they generate, are expanded. One possible technique is to use the heuristic 
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to select an uncxpanded existential role restriction 3R.C from the label of a 
node x, apply the 3-rule and the V-rule as necessary, and expand the label of 
the resulting i?-successor. If the expansion results in a clash, then the algorithm 
will backtrack; if it does not, then continue selecting and expanding existential 
role restrictions from L(x) until it is fully expanded. A better technique is to 
first apply the 3-rule and the V-rule exhaustively, creating a set of successor 
nodes. The order in which to expand these successors can then be based on 
the minimal maximum values in the dependency sets of all the concepts in their 
label, some of which may be due to universal role restrictions in L(x). 

The main advantage of heuristics is that they can be used to complement 
other optimisations. The MOMS and Jeroslow and Wang heuristics, for ex- 
ample, are designed to increase the effectiveness of BCP while the oldest-first 
heuristic is designed to increase the effectiveness of backjumping. They can also 
be selected and tuned to take advantage of the kinds of problem that arc to 
be solved (if this is known). The BCP maximisation heuristics, for example, 
are generally quite effective with large randomly generated and hand crafted 
problems, whereas the oldest- first heuristic is more effective when classifying 
realistic KBs. 

Unfortunately heuristics also have several disadvantages. They can add a 
significant overhead as the heuristic function may be expensive to evaluate and 
may need to be reevaluated at each branching point. Moreover, they may not 
improve performance, and may significantly degrade it, for example by interact- 
ing adversely with other optimisations, by increasing the frequency with which 
pathological worst cases can be expected to occur in generally easy problem 
sets. 

6.5 Caching satisfiability status 

During a satisfiability check there may be many successor nodes created. Some 
of these nodes can be very similar, particularly as the labels of the i?-successors 
for a node x each contain the same concepts derived from the universal role 
restrictions in L(x). Systems such as DLP take advantage of this similarity by 
caching the satisfiability status of the sets of concepts with which node labels 
are initialised when they are created. The tableaux expansion of a node can 
then be avoided if the satisfiability status of its initial set of concepts is found 
in the cache. 

However, this technique depends on the logic having the property that the 
satisfiability of a node is completely determined by its initial label set, and, due 
to the possible presence of inverse roles, SI logics do not have this property. 
For example, if the expansion of a node x generates an i?-successor node y, with 
Z(y) — {Vi?".C}, then the satisfiability of y clearly also depends on the set of 
concepts in L(x). Similar problems could arise in the case where L(y) contains 
number restriction concepts. 

If it is possible to solve these problems, then caching may be a very effective 
technique for ST logics, as it has been shown to be in the DLP system with a 
logic that does not support inverse roles. Caching is particularly useful in KB 
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classification as cached values can be retained across multiple satisfiability tests. 
It can also be effective with both satisfiablc and unsatisfiable problems, unlike 
many other optimisation techniques that are primarily aimed at speeding up 
the detection of unsatishability. 

The main disadvantage with caching is the storage overhead incurred by 
retaining node labels (and perhaps additional information in the case of ST 
logics) and their satisfiability status throughout a satisfiability test (or longer, 
if the results arc to be used in later satisfiability tests). An additional problem is 
that it interacts adversely with the backjumping optimisation as the dependency 
information required for backjumping cannot be effectively calculated for nodes 
that are found to be unsatisfiable as a result of a cache lookup. Although the 
set of concepts in the initial label of such a node is the same as that of the 
expanded node whose (un)satisfiability status has been cached, the dependency 
sets attached to the concepts that made up the two labels may not be the same. 
However, a weaker form of backjumping can still be performed by taking the 
dependency set of the unsatisfiable node to be the union of the dependency sets 
from the concepts in its label. 



7 Discussion 



A new DL system is being implemented based on the STCIQ algorithm we have 

~[|Horrocks et al., 1999 1 



4.1 



developed from the iSHX^-algorithm described in Section 
Pending the completion of this project, the existing FaCT system Horrocks, 1998b 
has been modified to deal with inverse roles using the STtHF blocking strategy, 
the resulting system being referred to as iFaCT. 

iFaCT has been used to conduct some initial experiments with a terminology 
representing (fragments of) database schemata and inter schema assertions from 
a data warehousing application [Calvanese et al., 1998a (a slightly simplified 
version of the proposed encoding was used to generate STLIT terminologies) . 
iFaCT is able to classify this terminology, which contains 19 concepts and 42 
axioms, in less than 0.1s of (266MHz Pentium) CPU time. In contrast, eliminat- 
ing inverse roles using an embedding technique Calvanese et al., 19981] gives an 
equisatisfiable FaCT terminology with an additional 84 axioms, but one which 
FaCT is unable to classify in 12 hours of CPU time. As discussed in Section [| an 
extension of the embedding technique can be used to eliminate number restric- 
tions [ Dc Giacomo fe Lcnzcrini, 1995| ], but requires a target logic which supports 



the transitive closure of roles, i.e., converse-PDL. The even larger number of 
axioms that this embedding would introduce makes it unlikely that tractable 
reasoning could be performed on the resulting terminology. Moreover, we are not 
aware of any algorithm for converse-PDh which does not employ a so-called look 
behind analytical cut [De Giacomo & Massacci, 1999], the application of which 
introduces considerable additional non-determinism. It seems inevitable that 
this would lead to a further degradation in empirical tractability. 

The DL SHIQ will allow the above mentioned encoding of database schemata 
to be fully captured using qualified number restrictions. Future work will include 
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completing the implementation of the SJiTQ, algorithm, testing its behaviour 
in this kind of application and investigating new techniques for improving its 
empirical tractability. 
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